Yijia Yu
08/05/2020
No single asset allocation is resilient to all economic regimes. Different economic regimes call for different asset allocations.
Economic regimes or scenarios (recession, expansion, etc.) can be defined in terms of 4 key economic factors: economic growth, inflationary expectations, monetary policy, and unemployment rate.
Adopting a regime-based asset allocation policy within an overall strategic portfolio may significantly enhance the portfolio’s efficiency.
Developing and implementing a regime-based allocation policy doesn’t require perfect forecasting skills, since anticipating the direction of regime change matters more than predicting the absolute rate of change in the 4 factors.
In this study, I used two unsupervised learning methods, principle component analysis (PCA) and k-means clustering to classify the historical data from 1960 to 2020 into different economic regimes.
PCA: deriving a low-dimensional set of features from a large set of variables.
K-means: partitioning a data set into k distinct, non-overlapping clusters.
There are 13 indices reflecting US economic market performance used in this study. Those indices are listed in the table below, in which you will see the name of each index and what it represent for.
| Index | Description |
|---|---|
| EHGDUS Index | US Real GDP (QoQ, %, SAAR) |
| CPI YOY Index | US CPI (inflation) Urban Consumer YoY NSA |
| CPI CHNG Index | US CPI (inflation) Urban Consumer MoM SA |
| EHUPUS Index | US Unemployment Rate (%) |
| IP CHNG Index | US Industrial Production MoM SA |
| NHSPATOT Index | Private Housing Units Permits Total SAAR (thousands) |
| NFP TCH Index | US Employment on Nonfarm Payrolls Total (SA, Net Monthly Change, thousands) |
| TMNOCHNG Index | US Manufacturing New Orders Total MoM SA |
| LEI TOTL Index | Conference Board US Leading Economic Indicator |
| PITL YOY Index | US Personal Income YoY SA |
| CICRTOT Index | Federal Resrve Consumer Credit Total Net Change SA |
| USCABAL Index | US Nominal Account Balance (Billions USD) |
| M2% YOY Index | Federal Reserve Money Supply M2 YoY % Change |
Since the original data are mixed-frequency time series. To solve this problem, the frequency of all indices used in this study was chosen as quarterly. Besides, to solve the inconsistency of time length, i.e. different starting time of all indices, I used the subset series of the longest common length which starts in 1960 Q1 and ends in 2020 Q1.
All index series are standardized, that is, centering each series (makes its mean equal to 0) and multiplying a constant to each series to make its variances equal to 1. By the data standardization, we may ignore the absolute value of a single series, but focus on the internal relationship within those economic indices.
Some of economic indices are closely correlated with each other.
## The correlation between US Real GDP (QoQ, %, SAAR) and
## US Industrial Production MoM SA is 0.44 .
## The correlation between US Real GDP (QoQ, %, SAAR) and
## US Employment on Nonfarm Payrolls Total (SA, Net Monthly Change, thousands) is 0.516 .
## The correlation between US CPI (inflation) Urban Consumer YoY NSA and
## US CPI (inflation) Urban Consumer MoM SA is 0.719 .
## The correlation between US CPI (inflation) Urban Consumer YoY NSA and
## US Personal Income YoY SA is 0.709 .
## The correlation between Conference Board US Leading Economic Indicator and
## Federal Resrve Consumer Credit Total Net Change SA is 0.662 .
## The correlation between US CPI (inflation) Urban Consumer MoM SA and
## US Personal Income YoY SA is 0.516 .
## The correlation between US Personal Income YoY SA and
## US Nominal Account Balance (Billions USD) is 0.495 .
## The correlation between US Industrial Production MoM SA and
## US Employment on Nonfarm Payrolls Total (SA, Net Monthly Change, thousands) is 0.681 .
## The correlation between US Industrial Production MoM SA and
## US Manufacturing New Orders Total MoM SA is 0.501 .
## The correlation between US Employment on Nonfarm Payrolls Total (SA, Net Monthly Change, thousands) and
## US Manufacturing New Orders Total MoM SA is 0.428 .
According to List of recessions in the US, all quarters involved in the listed US financial recession periods were labeled as “recession”.
According to List of expansions in the US, all quarters involved in the listed US financial expansion periods were labeled as “expansion”.
The plot below shows the contributions of variables in accounting for the variability to the top 2 principal components, that is, the higher contribution (%) of one economic index in this graph, the greater necessity to including this index into our analysis.
Some highly correlated indices result trivial variables which do not contribute variance to first two principle components.
This plot displays the annually data which is projected onto the plane given by the first two PC score vectors.
Individuals with a similar profile are grouped together. Points with a higher cos2 (square cosine, the quality of representation of the PC ) are far away from the center.
Positive correlated variables point to the same side of the plot. Negative correlated variables point to opposite sides of the graph.
Yearly data was labeled as recession/expansion if there is more than two quarters in the year labeled as recession/expansion. If there are exactly two quarters of recession and two quarters of expansion in a year, then it will be grouped in ‘other’.
The score of 2020 is very unusual, far away from scores of all previous data.
After removing 2020, the data become more dispersed in the panel. But some highly correlated variables are clustering together, giving bad classification result.
After removing the least 3 contributing variables, the results become more reasonable.
K-means clustering (MacQueen 1967) is one of the most commonly used unsupervised machine learning algorithm for partitioning a given data set into a set of k groups (i.e. k clusters), where k represents the number of groups pre-specified by the analyst. It classifies objects in multiple groups (i.e., clusters), such that objects within the same cluster are as similar as possible (i.e., high intra-class similarity), whereas objects from different clusters are as dissimilar as possible (i.e., low inter-class similarity).
For the quarterly data, we can keep 11 economic indices since even the contribution of the least important index is above 5%